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AMENDMENTS TO THE SPECIFICATION: 
Please amend paragraph [0037] as follows: 

FIG. 1 illustrates the general method for filtering relevant information from a corpus of 
documents 101 or from a st e am stream of documents 130. According to one embodiment of the 
invention, a document is a text file containing one or more strings of characters or other symbols 
that are used to form more complex constructs. For example, strings of characters may form 
words, phrases, sentences, and paragraphs. The constructs contained in the documents are not 
limited to constructs or forms associated with any particular language. Furthermore, the feature 
types that may be used are not limited to linguistic strings or symbols, but can include other 
feature types such as: structural features, such as the number of fields or sections or paragraphs 
or tables in the document; physical features, such as the ratio of "white" to "dark" areas or the 
color patterns in an image of the document; annotation features, the presence or absence or the 
value of annotations recorded on the document in specific fields or as the result of human or 
machine processing; derived features, such as those resulting from transformation fiinctions such 
as latent semantic analysis and combinations of other features; and many other feature types that 
may be apparent to practitioners of the art. 
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Application No. 10/713,645 
Amendment Dated 16 October 2007 
Reply to Office Action of 1 9 July 2007 

Please amend paragraph [0049] as follows: 

In This table, the Terms column lists a unique union of all the terms contained in the two 
documents Di and D2. The values listed under Di and D2 refer to the frequency of those terms as 
■they appear in the documents. The values listed under Qi and Q2 refer to the frequency of the 
terms as they appear in the profiles corresponding to Qi and Q2. Using the similarity function 
illustrated above, the similarity score of the profile Qi to document Di is computed as: 
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